Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine

نویسندگان

  • Aaron M. Cohen
  • Neil R. Smalheiser
  • Marian S. McDonagh
  • Clement T. Yu
  • Clive E. Adams
  • John M. Davis
  • Philip S. Yu
چکیده

OBJECTIVE For many literature review tasks, including systematic review (SR) and other aspects of evidence-based medicine, it is important to know whether an article describes a randomized controlled trial (RCT). Current manual annotation is not complete or flexible enough for the SR process. In this work, highly accurate machine learning predictive models were built that include confidence predictions of whether an article is an RCT. MATERIALS AND METHODS The LibSVM classifier was used with forward selection of potential feature sets on a large human-related subset of MEDLINE to create a classification model requiring only the citation, abstract, and MeSH terms for each article. RESULTS The model achieved an area under the receiver operating characteristic curve of 0.973 and mean squared error of 0.013 on the held out year 2011 data. Accurate confidence estimates were confirmed on a manually reviewed set of test articles. A second model not requiring MeSH terms was also created, and performs almost as well. DISCUSSION Both models accurately rank and predict article RCT confidence. Using the model and the manually reviewed samples, it is estimated that about 8000 (3%) additional RCTs can be identified in MEDLINE, and that 5% of articles tagged as RCTs in Medline may not be identified. CONCLUSION Retagging human-related studies with a continuously valued RCT confidence is potentially more useful for article ranking and review than a simple yes/no prediction. The automated RCT tagging tool should offer significant savings of time and effort during the process of writing SRs, and is a key component of a multistep text mining pipeline that we are building to streamline SR workflow. In addition, the model may be useful for identifying errors in MEDLINE publication types. The RCT confidence predictions described here have been made available to users as a web service with a user query form front end at: http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/RCT_Tagger.cgi.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of Acupuncture on Anxiety in Infertile Women: A Systematic Review of the Literature

Background & aim: Stress and anxiety due to waiting for treatment results and uncertainty of treatment success are common problems in infertile women. Acupuncture has been suggested as an effective strategy to relieve anxiety. This study aimed to review the available evidence on the effects of acupuncture on anxiety in infertile women. Methods: This systematic review was conducted via searching...

متن کامل

The level of evidence of published articles on orthodontics in PubMed journals from Iran during 2000-2015

BACKGROUND AND AIM: Evidence-based dentistry (EBD), including orthodontics, needs the availability and use of the high-quality studies. The aim of this study was to identify the level of evidence (LOE) of Iranian articles on orthodontics published in PubMed.METHODS: All the articles on orthodontics published from 2000 to 2015 in PubMed with Iran affiliations were extracted by typing orthodontic...

متن کامل

Adherence to the CONSORT Statement in the Reporting of Randomized Controlled Trials on Pharmacological Interventions Published in Iranian Medical Journals

Background: Among manuscripts submitted to biomedical journals, randomized controlled trials (RCTs) form the backbone of evidence-based medicine. Hence, their protocol should be designed rigorously and their results should be reported clearly. To improve the quality of RCT reporting, researchers developed the CONSORT Statement in 1996 and updated it in 2010. This study was designed to assess th...

متن کامل

Automating Biomedical Evidence Synthesis: RobotReviewer

We present RobotReviewer, an open-source web-based system that uses machine learning and NLP to semi-automate biomedical evidence synthesis, to aid the practice of Evidence-Based Medicine. RobotReviewer processes full-text journal articles (PDFs) describing randomized controlled trials (RCTs). It appraises the reliability of RCTs and extracts text describing key trial characteristics (e.g., des...

متن کامل

Comparing live lecture, internet-based & computer-based in-struction: A randomized controlled trial

  Background :Comparing computer and internet based instruction with traditional giving lecture would provide enough evidence to identify best teaching practice. In this study, we compared lecture, interactive internet based and computer based learning regarding medical students' knowledge acquisition and satisfaction in teaching pathophysiology of hematology and oncology.   Methods : Eighty fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2015